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A method of producing a polypeptide product end a plasmidlc expression vehicle therefor, a method of creating an 
expression plasmld, a method of cleaving double stranded DNA, and specific plasmlds. 



@ Novel plasm idic expression vehicles and methods of 
using them in the production of useful polypeptides by 
recombinant bacteria are described. The plasmids employ a 
tryptophan promoter-operator system from which the 
attenuator region ordinarily present has been deleted. Bac- 
teria containing- the plasmids can accordingly be repressed 
by the addition of tryptophan against expression of desired 
polypeptides coded for by inserted genes while they are 
grown to levels suitable for industrial-scale production. 
01 Additive tryptophan may then be withdrawn, essentially 
derepressing the pathway and permitting efficient produc- 
tion of the desired product in high yield. 
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A METHOD OF PRODUCING A POLYPEPTIDE PRODUCT 
" AND A PIASMIDIC EXPRESSION VEHICLE THEREFOR, 
A METHOD OF CREATING AN EXPRESSION PIASMID, 
A METHOD OF CLEAVING DOUBLE STRANDED DN&, 
5 AND SPECIFIC PIASMIDS. 



BACKGROUND OF THE INVENTION 

10 • With the advent of recombinant DNA technology, the controlled 

bacterial production of an enormous variety of useful polypeptides has. 
become possible. Already in hand are bacteria modified by this 
technology to permit the production of such polypeptide products such as 
somatostatin (K. Itakura, et all, , Science 198, 1056 [1977]), the 

15 (component) A and B chains of human insulin (D.V, Goeddel, et a/L , Proc 
Nat'l Acad Sci, USA 76, 106 [1979]), and human growth hormone (D.V. 
Goeddel, et al., Nature 281, 544 [1979]). More recently, recombinant 
0.NA techniques have been used to occasion the bacterial production of 
thymosin alpha 1, an immune potentiating substance produced by the 
20 thymus. Such is the power of the technology that virtually 
any useful polypeptide can be bacterially produced, putting 
within reach the controlled manufacture of hormones, 
enzymes, antibodies, and vaccines against a wide variety 
of diseases. The cited materials, which describe 
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in greater detail the representative examples referred to above, are 
incorporated herein by reference, as are other publications referred to 
jnfra, to illuminate the background of the invention. 

The work horse of recombinant DNA technology is the plasmid, a 

5 non-chromosomal loop of double-stranded DNA found in bacteria, 
oftentimes in multiple copies per bacterial cell. Included in the 
information encoded in the .plasmid DNA is that required to reproduce the 
plasmid in daughter cells (i.e.,* a "replicon") and ordinarily, "one or 
more selection characteristics, such as resistance to antibiotifcs, which 

10 permit clones of the host cell containing the plasmid of interest to be 
recognized and preferentially grown in selective media. The utility of 
bacterial plasmids lies in the fact that they can be specifically 
cleaved by one or another restriction endonuclease or "restriction 
enzyme 0 , each of which recognizes a different site on the plasmidic 

15 DNA. Thereafter heterologous genes or gene fragments may be inserted 
into the plasmid by endwise joining at the cleavage site or at 
reconstructed ends adjacent 'the cleavage site. As used herein, the term- 
"heterologous" refers to a gene not "ordinarily found in, or a 
polypeptide sequence ordinarily not produced by, E_. coli , whereas the 

20 term "homologous" refers to a gene or polypeptide which is produced in 
wild-type coli . DNA recombination is performed outside the bacteria, 
but the resulting "recombinant" plasmid can be introduced into bacteria 
by a process known as transformation and large quantities of the 
heterologous gene-containing recombinant plasmid obtained by growing the 

25 transformant. Moreover, where the gene is properly inserted with 

reference to portions of the plasmid which govern the transcription and 
translation of the encoded DNA message, the resulting expression vehicle 
can be used to actually produce the polypeptide sequence for which the 
inserted gene codes, a process referred to as expression. 

30 Expression is initiated in a region known as the promoter which is 

recognized by and bound by RNA polymerase. In some cases, as in the trp 
operon discussed infra , promoter regions are overlapped by "operator" 
regions to form a combined promoter-operator. Operators are DNA 
sequences which are recognized by so-called repressor proteins which 

35 serve to regulate the frequency of transcription initiation at a 
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particular promoter. The polymerase travels along the ON A, transcribing 
the information contained in the coding strand from its 5' to 3" end 
. into messenger RNA which is in turn translated into a polypeptide having 
the amino acid sequence for which the ONA codes. Each amino acid is 

5 " encoded by a unique nucleotide triplet or "codon" within what «?y for 
present purposes be referred to as the "structural gene", i.e. that part 
Which encodes the amino acid sequence of the expressed product. After 
binding to the promoter, the RNA polymerase first transcribes 
nucleotides encoding a ribosome binding site, then a translation 

10 initiation or "start" signal (ordinarily ATG, which in the resulting 
messenger RNA becomes AUG), then the nucleotide codons within the 
structural gene itself. So-called stop codons are transcribed at the 
end of the structural gene whereafter the polymerase may form an 
additional sequence of messenger RNA which, because of the presence of 

15 the stop signal, will remain untranslated by the ribosomes. Ribosomes 
bind to the binding site provided on the messenger RNA, in bacteria 
ordinarily as the mRNA is being formed, and themselves produce the 
encoded polypeptide, beginning at the translation start signal and 
ending at the previously mentioned stop signal. The desired product is 

20 produced if the sequences encoding the ribosome binding site are 

positioned properly with respect to the AUG initiator codon and if all 
remaining codons follow the initiator codon in phase. The resulting 
product may be obtained by lysing the host cell and recovering the 
product by appropriate purification from other bacterial protein. 

25 Polypeptides expressed through the use of recombinant ONA 

technology may be entirely heterologous, as in the case of the direct 
expression of human growth hormone, or alternatively may comprise a 
heterologous polypeptide and, fused thereto, at least a portion of the 
amino acid sequence of a homologous peptide, as in the case of the 

30 production of intermediates for somatostatin and the components of human 
insulin. In the latter cases, for example, the fused homologous 
polypeptide comprised a portion of the amino acid sequence for beta 
galactosidase. In those cases, the intended bioactive product is 
bioinactivated by the fused, homologous polypeptide until the latter is 

35 cleaved away in an extracellular environment. Fusion proteins like 
those just mentioned can be designed so as to permit highly specific 
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cleavage of the precusor protein from the intended product, as by the 
action of cyanogen bromide on methionine, or alternatively by enzymatic 
cleavage. See, eg., G.B. Patent Publication No. 2 007 676 A. 

If recombinant DNA technology is to fully sustain its promise, 
5 systems must be devised which optimize expression of gene inserts, so 
that the intended polypeptide products can be made available in high 
yield. The beta lactamase and lactose promoter-operator systems most 
commonly used in the past, while useful, have not fully utilized the 
capacity of the technology from the standpoint of yield. A need has 
10 existed for abacterial expression vehicle capable of the controlled 
expression of desired polypeptide products in higher yield. 

Tryptophan is an amino acid produced by bacteria for use as a 
component part of homologous polypeptides in a biosynthetic pathway 
which proceeds: chorismic acid-* anthrani lie acid-»phosphoribosyl 
15 anthranilic acid *■ CORP [enol-l-(o-carboxyphenylamino)-l-desoxy-D- 
ribulose-5-phosphate]-*indol-3-glycerol-phosphate, and ultimately to 
tryptophan itself. The enzymatic reactions of this pathway are 
catalyzed by the products of the tryptophan or "trp" operon, a 
polycistronic DNA segment which is transcribed under the direction of 
the trp promoter-operator system. The resulting polycistronic messenger 
RNA encodes the so-called trp leader sequence and then, in order, the 
polypeptides referred to as trp E, trp 0, trp C, trp B and trp A. These 
polypeptides variously catalyze and control individual steps in the 
pathway- chorismic acid tryptophan. 

25 In wild-type E. coli, the tryptophan operon is under at least three 

distinct forms of control. In the case of promoter-operator repression, 
tryptophan acts as a compressor and binds to its aporepressor to form 
an active repressor complex which, in turn, binds to the operator, 
closing down the pathway in its entirety. Secondly, by a process of 
feedback inhibition, tryptophan binds to a complex of the trp E and trp 
0 polypeptides, prohibiting their participation in the pathway 
synthesis. Finally, control is effected by a process known as 
attenuation under the control of the "attenuator region" of the gene, a 
region within the trp leader sequence. See generally G.F. Miozzari 



20 



30 



-5- 



0036776 



et al., J. Bacteriology 133, l45 7 (1978); The 0 PerO n 263-302, Cold Spring 
Harbor Laboratory (1978), Miller and Reznikoff, eds.; F. Lee et al., 
Proc. Natl. Acad. Sci. USA 74, 4365 (1977) and K. Bertrand et al, J. 
Hoi. Biol. 103, 319 (1976). The extent of attenuation appears to be 
governed by the intracellular concentration of tryptophan, and in 
wild-type E. coH the attenuator terminates expression in approximately 
nine out of ten cases, possibly through the formation of a secondary 
structure, or "termination loop", in the messenger RNA which causes the 
RNA polymerase to prematurely disengage from the associated DNA. 
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Other workers have employed the trp operon to obtain some measure 
of heterologous polypeptide expression. This work, it is believed, 
attempted to deal with problems of repression and attenuation by the 
addition of-indole acrylic acid, an inducer and analog which competes 
with tryptophan for trp repressor molecules, tending toward derepression 
15 by competitive inhibition. At the same time the inducer diminishes 
attenuation by inhibiting the enzymatic conversion of indole to 
tryptophan and thus effectively depriving the cell of tryptophan. As a 
result more polymerases successfully read through the attentuator. 
However, this approach appears problematic from the standpoint of 
20 completing translation consistently and in high yield, since 

tryptophan-containing protein sequences are prematurely terminated in 
synthesis due to lack of utilizable tryptophan. Indeed, an effective 
relief of attenuation by this approach is entirely dependent on severe 
tryptophan starvation. 

25. The present invention addresses problems associated with tryptophan 
repression and attenuation in a different manner and provides (1) a 
method for obtaining an expression vehicle designed for direct 
expression of heterologous genes from the trp promoter-operator, (2)' 
methods for obtaining vehicles designed for expression, from the 

30 tryptophan operator-promoter, of specifically cleavable polypeptides 
coded by homologous-heterologous gene fusions and (3) a method of 
expressing heterologous polypeptides control lably, efficiently and in 
high yield, as well as the associated means. 
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According to the present invention, novel plasmidic expression 
vehicles are provided for the production in bacteria of heterologous 
polypeptide products, the vehicles having a sequence of double-stranded 

5 DNA comprising, in phase from a first 5' to a second 3 1 end of the 
coding strand, a trp promoter-operator, nucleotides coding for the trp 
leader ribosome binding site, and nucleotides encoding translation 
initiation for expression of a structural gene that encodes the amino 
\ acid sequence of the heterologous polypeptide. The DNA sequence referred 

10 to- contains neither a trp attenuator region nor nucleotides coding for 
the trp E ribosome binding site. Instead, the trp leader ribosome 
binding site is efficiently used to effect expression of the information 
encoded by an inserted gene. 

Cells are transformed by addition of the trp promoter-operator-' 

15 containing and attenuator-lacking plasmids of the invention and'grown up 
in the presence of additive tryptophan. The use of tryptophan-rich 
media provides sufficient tryptophan to essentially completely repress 
the trp promoter-operator through trp/repressor interactions, so that 
cell growth can proceed uninhibited by premature expression of large 

20 quantities of heterologous polypeptide encoded by an insert otherwise 
under the control of the trp promoter-operator system. When the 
recombinant culture has been grown to the levels appropriate for 
industrial production of the polypeptide, on the other hand, the 
external source of tryptophan is removed, leaving the cell to rely only 

25 on the tryptophan that it can itself produce. The result is mild 

tryptophan limitation and, accordingly, the pathway is derepressed and 
highly efficient expression of the heterologous insert occurs, 
unhampered by attenuation because the attenuator region has been deleted 
from the system. In this manner the cells are never severely deprived 

30 of tryptophan and all proteins, whether they contain tryptophan or not, 
can be produced in substantial yields. 

The invention further provides means of cleaving double-stranded 
0NA at any desired point, even absent a restriction enzyme site, a 
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technique, useful in, £ ^ 0 ng other things, the creation of trp operons 
having attenuator deletions other than those previously obtained by 
selection of mutants. 

Finally, the invention provides a variety of useful intermediates 
and endproducts, including specifically cleavable heterologous- 
homologous fusion proteins that are stabilized against degradation under 
expression conditions. 

The manner in which these and other objects and advantages of the 
invention are obtained win become more apparent from the detailed 
description which follows and from the accompanying drawings in which: 

Figures 1 and 2 illustrate a preferred scheme for forming plasmids 
capable of expressing heterologous genes as fusions with a 
portion of the trp 0 polypeptide, from which fusion they may 
be later cleaved; 

Figure 3 is the result of polyacrylamide gel segregation of cell 
protein containing homologous (trp D' ) - heterologous 
(somatostatin or thymosin a I) fusion proteins: 
Figures 4, 5 and 6 illustrate successive stages in a preferred 
scheme for the creation of a plasmid capable of directly 
expressing a heterologous gene (human growth hormone) under 
the control of the trp promoter-operator system; 
Figure 7 is the result of polyacrylamide gel segregation of cell 
protein containing human growth hormone directly expressed 
under the control of the trp promoter-operator system- 
Figures 8,9 (a-b) and 10 illustrate in successive stages a' 
preferred scheme for the creation of plasmids capable of 
expressing heterologous genes (in the illustrated case, for 
somatostatin) as fusions with a portion of the trp E. ' 
polypeptide, from which fusions they may be later cleaved-' 
Figure 11 is the result of polyacrylamide gel segregation of cell 
protean containing homologous (trp E) - heterologous fusion 
proteins for the production of, respectively, somatostatin, 
thymosin alpha I, human proinsulin, and the A and B chains of 
human insulin. 
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Figures 12 and 13 illustrate in successive stages the manner in 
which the plasmid created by the scheme of Figures 3-10 
inclusive is manipulated to form a system in which other 
heterologous genes may be interchangeably expressed- as fusions 
with trp E polypeptide sequences. 
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In the Figures, only the coding strand of the double-stranded 
plasmid and linear DNAs are depicted in most instances, for clarity in 
illustration. Antibiotic resistance-encoding genes are denoted ap R 
(ampicillin) and tc R (tetracycline). The legend tc S connotes a gene 
for tetracycline resistance that is not under the control of a 
promoter-operator system, such that plasmids containing the gene will 
•nevertheless be tetracycline sensitive. The legend "ap S " connotes 
ampicillin sensitivity resulting from deletion of a portion of the gene 
encoding ampicillin sensitivity. Plasmidic promoters and operators are 
15 denoted "p" and "o". The- letters A, T, G and C respectively connote the 
nucleotides containing the bases adenine, thymine, guanine and 
cytosine. Other Figure legends appear from the text. 



The preferred embodiments of the invention described below involved 
use of a number of commonly available restriction endonucleases next 
20 identified, with their corresponding. recognition sequences and 
(indicated by arrow) cleavage patterns. - 



25 



30 



Xbal: 



EcoRI: 



Bglll: 



PvuII 



SamHI: 



CTAGA 
AGATCjT 

GAATTC 

CTTAAG 
t 



i 



GATCT 

TCTAGA 
t 

GAGCTG 

GTCGAC 
t 

GGATCC 

CCTAGG 
t 



TaqI: 



Hindlll: 



Hpal : 



PstI: 



TCGA 
AGCf 

AAGCTT 

TTCGAA 
t 

I 

GTTAAC 

CAATTG 
t 

CTGCAG 
GACGTC 

t 
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Where the points of cleavage are spaced apart on the respective strands 
the cleaved ends will be -sticky", ie, capable of reanneaiing or of 
annealing to other complementarily "sticky"-ended DNA by Watson-Crick 
base pairing (A to T and G to C) in mortise and tenon fashion. Some 
restriction enzymes, such as Hpal and PvuII above, cleave to leave 
"blunt" ends. The nucleotide sequences above are represented in 
accordance with the convention usednhroughout: upper strand is the 
protein encoding strand, and in proceeding from left to right on that 
strand one moves from the 5' to the 3' end thereof, ie, in the direction 
of transcription from a "proximal" toward a "distal" point. 

Finally with regard to conventions, the symbol "a" connotes a 
deletion. Thus, for example, reference to a plasmid followed by, say 
"AEcoRI-Xbal" describes the plasmid from which the nucleotide sequence 
between EcoRI and Xbal restriction enzyme sites has been removed by 
15 d lges tion with those enzymes. For convenience, certain deletions are 
denoted by number. Thus, beginning from the first base pair ("bp") of 
the EcoRI recognition site which precedes the gene for tetracycline 
resistance in the parental plasmid pBR322,- w connotes deletion of 
bpl-30 (ie, AEcoRI-Hind III) and consequent disenabling of the 
20 tetracycline promoter-operator system; "a2" connotes deletion of bp 1-375 
(ie, AEcoRI-BamHI) and consequent removal of both the tetracycline 
promoter-operator and the structural gene which encodes tetracycline 
resistance; and "a3" connotes deletion of bp 3611-4359 (ie. APstl-EcoRI) 
and elimination of ampicillin resistance. »a4" is used to connote 
25 removal of bp -900 --1500 from the trp operon fragment 5 (Fig. 1) 
eliminating the structural gene for the trp D polypeptide. 



DETAILED DESCRIPTION 



The trp leader sequence is made up of base pairs ("bp") 1-162 
starting from the start point for trp mRNA. A fourteen amino acid' 
30 putative trp leader polypeptide is encoded by bp 27-71 following the ATG 
nucleotides which encode the translation start signal. The trp 
attenuator region comprises successive GC-rich and AT-rich sequences 
lying between bp 114 and 156 and attenuation is apparently effected on 
mRNA nucleotides encoded by bp -134-141 of the leader sequence To 
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express a heterologous polypeptide under the direction of the trp leader 
ribosome binding site and at the same time avert attenuation, the 
following criteria must be observed: 

1. Base pairs 134-141 or beyond must be deleted; 
5 . 2 « Th e ATG codon of the inserted gene must be positioned in 

correct relation to a ribosome binding site, as is known (see, 
eg.^jh-A. Steitz "Genetic signals and nucleotide sequences in 
messenger RNA" in Biological Regulation and Control (ed. R. 
Goldberger) Plenum Press, N.Y. (1978). 
10 3. Where a homologous-heterologous fusion protein is to be 

produced, the translation start signal of a homologous 
polypeptide sequence should remain available, and the codons 
for the homologous portion of the fusion protein have to be 
inserted in phase without intervening translation stop signals. 
15 For example, deleting all base pairs within the leader sequence 

distal from.bp* 70 removes the attenuator region, leaves the ATG 
sequence which encodes the translation start signal, and eliminates the 
intervening translation stop encoded by TCA (bp. 69-71), by eliminating 
A and following nucleotides. Such a deletion would result in expression 
20 of a fusion protein beginning with the leader polypeptide, ending with 
that encoded by any heterologous insert, and including a distal region 
of one of the post-leader trp operon polypeptides determined by the 
extent of the deletion in the 3' direction. Thus a deletion extending 
into the.E gene would lead to expression of a homologous precursor 
25 comprising the I sequence and the distal region of E (beyond the 
deletion endpoint) fused to the sequence encoded by any following 
insert, and so on. 

Two particularly useful plasmids from which the attenuator region 
has been deleted are the plasmids pGMl and pGM3, G.F. Miozzari et al, 
30 J- Bacteriology 133 , 1457 (1978). These respectively carry the 

deletions trp aLE 1413 and trp aLE 1417 and express (under the control 
of the trp promoter-operator) a polypeptide comprising approximately the 
first six amino acids of the trip leader and distal regions of the E 
polypeptide. In the most preferred case, pGMl, only about the last 
35 third of the E polypeptide is expressed whereas pGM3 expresses almost 
the distal one half of the E polypeptide codons. E. coli K-12 strain 
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W3110 tna 2~trp"al02 containing pGMl has been deposited with the 
American Type Culture Collection (ATCC no. 31622). pGMl may be 
conventionally removed from the strain for use in the procedures 
described below. 

Alternatively, deletions may be effected by means provided by the 
invention for specifically cleaving double-stranded ONA at any desired 
site. One example of this cleavage technique appears from Part IV of 
the experimental section, infra . Thus, double-stranded ONA is converted 
to single-stranded DNA in the region surrounding the intended cleavage 
point, as by reaction with lambda exonuclease. A synthetic or other 
single-stranded DNA primer is then hybridized to the single-stranded 
length earlier formed, by Watson-Crick base-pairing, the primer sequence 
being such as to ensure that the 5' end thereof will be coterminous with 
the nucleotide on the first strand just prior to the intended cleavage 
15 point. The primer is next extended in the 3' direction by reaction with 
DNA polymerase, recreating that portion of the original double-stranded 
DNA prior to the intended cleavage that was lost in the first step. 
Simultaneously or thereafter, the portion of the. first strand beyond the 
intended cleavage point is digested away. To summarize, where "v" marks 
20 the intended cleavage point: 

a ^ — _Y. intended cleavage point "v" 



b ^ — v made single stranded 

around "v" 



25 c) 



primer hybridization 



d ) Y. extension from primer 

^\WV 



e) 



V 



single strand digestion 
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In the most preferred embodiment, steps (d) and (e) are performed 
simultaneously, using a polymerase that simultaneously digests the 
protruding single stranded end in the 3\» 5 1 direction and extends the 
primer (in the presence of dATP, dGTP, dTTP and dCTP) in the 5' » 3 1 

5 "direction. The material preferred for this purpose is Klenow Polymerase 
I, ie, that fragment obtained by proteolytic cleavage of DNA Polymerase 
I which contains the 5' > 3 1 polymerizing activity and the 3' > 5* 
exonucleolytic activity of the parental enzyme, yet lacks its 5 1 » 3 1 
exonucleolytic activity. A. Kornberg, DNA Synthesis, 98, W.H. Freeman 

10 and Co., SFO (1974). 

Using the procedure just described, attenuator deletions may be 
made in any desired manner in a trp operon-containing plasmid first 
linearized by, eg, cleavage at a restriction site .downstream from the 
point at which the molecule is to be blunt-ended ( M v w above). 
15 Recircularization following deletion of the attenuator region may be 
effected, eg, by blunt end ligation or other manners which will be 
apparent to the art-skilled.. 

Although the invention encompasses direct expression of 
heterologous polypeptide under the direction of the trp promoter- 

20 operator, the preferred case involves expression of fused proteins 
containing both homologous and heterologous sequences, the latter 
preferably being specifically cleavable from the former in 
extra-cellular environs. Particularly preferred are fusions in which 
the homologous portion comprises one or more amino acids of the trp. 

25 leader polypeptide and about one-third or more of the trp E amino acid 
sequence (distal end). Fusion proteins so obtained appear remarkably 
stabilized against degradation under expression conditions. 

Bacteria^, coli K-12 strain W3110 tna 2~trp~Al02 (pGMl)., ATCC 
No. 31622, may be used to amplify stocks of the pGHl plasmid preferably 
employed in constructing the attenuator-deficient trp promoter-operator 
30 systems of the invention. This strain is phenotypically trp* in the 
presence of anthranilate and can be grown in minimal media such as LB 
supplemented with 50 ug/ml anthranilate. 
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All bacterial strains used in trp promoter-operator directed 
expression according to. the invention are trp repressor* ("trp R + ") 
as in the case of wild-type E. coll, so as to ensure repression until 
heterologous expression is intended. 

5 DNA recombination is, in the preferred embodiment, performed in 

E. coll, K-12 strain -294 (end A, thi', hsr", hsm + ), ATCC No. ■ 
31446, a bacterial strain whose membrane characteristics facilitate 
transformations. Heterologous polypeptide-producing plasmids' grown in 
strain 294 are conventionally extracted and maintained in solution (eg, 

10 lOmM tris, ImM EDTA,pH8) at from about -20*C to about 4*C. For 

expression under industrial conditions, on the other hand, we prefer a 
more hardy strain, ie, E. coll K-12 x~F~ RV 308 str r , gal 308^ 
ATCC No. 31608. RV 308 is nutritionally wild-type and grows well in 
minimal media, synthesizing all necessary macromolecules from 

15 conventional mixes of ammonium, phosphate and magnesium salts, trace 
metals and glucose. After transformation of RV 308 culture with strain 
294-derived plasmid the culture is plated on media selective for a 
marker (such as antibiotic resistance) carried by the plasmid, and a 
transformant colony picked and grown in flask culture. Aliquots of the 

20 latter in 102 DMS0 or glycerol solution (in sterile Wheaton vials) are 
shell frozen in an ethanol-dry ice bath and frozen at -80'C. To produce 
the encoded heterologous polypeptide the culture samples are grown up in 
media containing tryptophan so as to repress the trp promoter-operator 
and .the system then deprived of additive tryptophan to occasion 
25 expression. 

For the first stage of growth one may employ, for example, LB 
medium (J.H. Miller, Experiments in Molecular Genetics, 433, Cold Spring 
Harbor Laboratory 1972) which contains, per liter aqueous solution, lOg 
Bacto tryptone, 5g Bacto yeast extract and lOg NaCl. Preferably, the 
30 inoculant is grown to optical density ("o.d.") of 10 or more (atSSO 
nM), more preferably to o.d. 20 or more, and most preferably to o.d. 30 
or more, albeit to less than stationary phase. 
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For derepression and expression the inoculant is next grown under 
conditions which deprive the cell of additive tryptophan. One 
appropriate media for such growth is M9 (J.H. Miller, supra at 431) 
prepared as follows (per liter): 

5. KH 2 P0 4 . 3 g 

Na 2 HP0 4 6g 
NaCl o.5g ' 

NH 4 C1 lg 

Autoclave, then add: 

10 10 ml 0.01M CaCl 2 

1 ml 1M MgS0 4 . * ... 

10 ml 202 glucose 

Vitamin Bl lpg/ral 

Humkp hycase amino 
15 or DIFCO cas. amino acids 40 yg/ml. 

The amino acid supplement is a tryptophan- lacking acid hydrqlysate of 
casein. 

To commence expression of the heterologous polypeptide the 
inoculant grown in tryptophan-rich media may, eg, be diluted into a- 

20 larger volume of medium containing no additive tryptophan (for example, 
2-10 fold dilution) grown up to any desired level (preferably short of 
stationary growth phase) and the intended product conventionally 
obtained by lysis, centrifugation and purification. In the 
tryptophan-deprived growth stage, the cells are preferably grown to od 

25 in excess of 10, more preferably in excess of od 20 and most preferably 
to or beyond od 30 (all at 550 nH) before product recovery. 

All ONA recombination experiments described in the Experimental 
section which follows were conducted at Genentech Inc. in accordance 
with the National Institutes of Health Guidelines for Recombinant ONA 
30 research. 
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I. Expression of D-polypeptide fusion protein 



A preferred method of expressing fusion proteins comprising desired 
polypeptides and, fused thereto, a portion of the amino acid sequence of 
the trp 0 polypeptide that is separable in vitro by virtue of a 
5 methionine amino acid specifically sensitive to CNBr cleavage, is 
described with reference to Figures 1-3. 

* 

A. Construction of pBRHtrp 

Plasmid pGMl {I, Fig. 1) carries the E. cp_H tryptophan operon 
containing the deletion ALE1413 (G.F. Miozzari, et aK, (1978) 
• 10 Bacteriology 1457-1486)) and hence expresses a fusion protein comprising 
the first 6 amino acids of the trp leader and approximately the last 
third of the trp E polypeptide (hereinafter referred to in conjunction 
as IE'), as well as the trp 0 polypeptide in its entirety, all under the 
control of the trp promoter-operator system. The plasmid, 20 P g, was 
15 digested with the restriction enzyme PvuII which cleaves the plasmid at 
five sites. The gene fragments 2 were next combined with EcoRI linkers 
(consisting of a self complementary oligonucleotide 3 of the sequence: 
pCATGAATTCATG) providing an EcoRI cleavage site for a later cloning into 
a plasmid containing an EcoRI site (20). The 20 u g of DMA fragments 2 
20 obtained from pGMl were treated with 10 units T 4 .DNA ligase in the 
presence of 200 pico moles of the 5'-phosphorylated synthetic 
oligonucleotide pCATGAATTCATG (3) and in 20 u l T 4 0NA ligase buffer 
(20mM tris, pH 7.6, 0.5 mM ATP, 10 mM MgCl 2 , 5 mM dithiothreitol ) at 
4*C overnight. The solution was then heated 10 minutes at 70*C to halt 
25 ligation. The linkers were cleaved by EcoRI digestion and the 
fragments, now with EcoRI ends were separated using 5 percent 
polyacryl amide gel electrophoresis (herein after "PAGE") and the three 
largest fragments isolated from the gel by first staining with ethidium 
bromide, locating the fragments with ultraviolet light, and cutting from 
30 the gel the portions of interest. Each gel fragment, with- 300 
microliters O.lxTBE, was placed in a dialysis bag and subjected to 
electrophoresis- at 100 v for one hour in O.lxTBE buffer (T3E buffer 
contains: 10.8 gm tris base, 5.5 gm boric acid, 0.09 gm Na 2 £0TA in 1 
liter H 2 0). The aqueous solution was collected from the dialysis 
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bag, phenol extracted, chloroform extracted and made 0.2 M sodium 
chloride, and the ONA recovered in water after ethanol 
precipitation. [All DNA fragment isolations hereinafter described, 
are performed using PAGE followed by the electroelution method just 
discussed]. The trp promoter-operator-containing gene with EcoRI 
sticky ends 5_ was identified in the procedure next described, which' 
entails the insertion of fragments into a tetracycline sensitive 
plasmid 6 which, upon promoter-operator insertion, becomes 
tetracycline resistant. 

8. Creation of the plasmid pBRHtrp expressing tetracycline 
resistance under the control of the trp promoter-operator and 
identification and amplification of the trp promoter-operator 
containing DNA fragment S isolated in (A.) above. 

Plasmid pBRHl (6), (R.I. Rodriguez, et aU, Nucleic Acids 
15 Research 6, 3267-3287 [1979]) expresses ampicilin resistance and 
contains the gene for tetracycline resistance but, there being no 
associated promoter, does not express that resistance. The plasmid 
is accordingly tetracycline sensitive. 8y introducing a 
promoter-operator system in the EcoRI site, the plasmid can be made 
20 tetracycline resistant. 

pBRHl was digested with EcoRI and the enzyme removed by phenol 
extraction followed by chloroform extraction and recovered in water 
after ethanol precipitation. The resulting DNA molecule 7 was, in 
separate reaction mixtures, combined with each of the three DNA 

25 fragments obtained in part A. above and ligated with T^ DNA ligase 
as previously described. The DNA present in the reaction mixture was 
used to transform competent E. c&U K-12 strain 294, K. Backman et 
11., Proc Nat'l Acad Sci USA 73, 4174-4198 [1976]) (ATCC no. 31448) 
by standard techniques (V. Hershf ield et ah, Proc Nat'l Acad Sci USA 

30 71, 3455-3459 [1974]) and the bacteria plated on LB plates containing 
20 wg/ml ampicillin and 5 ug/ml tetracycline. Several 
tetracycline-resistant colonies were selected, plasmid DNA isolated 
and the presence of the desired fragment confirmed by restriction 
enzyme analysis. The resulting plasmid 8, designated pBRHtrp, 
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expresses e-lactamase, imparting ampicillin resistance, and it 
contains a DMA fragment including the trp promoter-operator and 
encoding a first protein comprising a fusion of the first six amino 
acids of the trp leader and approximately the last third of the trp E 
polypeptide (this polypeptide is designated IE'), and a second 
protein corresponding to approximately the first half of the trp D 
polypeptide (this polypeptide is designated D'), and a third protein 
coded for by the tetracycline resistance gene. 

C. Cloning genes for various end-product polypeptides and expression 
of these as fusion proteins comprising end-product and specifically 
cleavable trp 0 polypeptide precursor (figure 2). 



A DNA fragment comprising the trp promoter-operator and codons 
for the IE' and D' polypeptides was obtained from plasmid pBRHtrp and 
inserted into plasmids containing structural genes for various 
15 desired polypeptides, next- exemplified for the case of somatostatin 
(Figure 2). 

pBRH trp was digested with EcoRI restriction enzyme and the 
resulting fragment 5 isolated by PAGE and electrocution. 
EcoRI-digested plasmid pSom 11 (K. Itakura et al, Science 198. 1056 
20 (1977); G.8. patent publication no. 2 007 676 A) was combined with 
fragment S. The mixture was ligated with T 4 DNA ligase as 
previously described and the resulting DNA transformed into E. col i 
K-12 strain 294 as previously described. Transformant bacteria were 
selected on ampicillin-containing plates. Resulting 
25 ampicillin-resistant colonies were screened by colony hybridization 
(M. Gruenstein et al_., Proc Nat'l Acad Sci USA 72', 3951-3965 [1975]) 
using as a probe the trp promoter-operator-containing fragment 5 
isolated from pBRHtrp, which had been radioactively labelled with 
P . Several colonies shown positive by colony hybridization were 
selected, plasmid DNA was isolated and the orientation of the 
inserted fragments determined by restriction analysis employing 
restriction enzymes Bglll and BamHI in double digestion. E. coli 294 
containing the plasmid designated P S0M7a2, H_, which has the trp~ 
promoter-operator fragment in the desired orientation was grown in LB 
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medium containing 10 u g/ml ampicillin. The cells were grown to 
optical density 1 (at 550 nM), collected by centrifugation and 
resuspended in H9 media in tenfold dilution. Cells were grown for 
2-3 hours, again to optical density 1, then lysed and total cellular 

5 protein analyzed by SOS (sodium dodcyl sulfate) urea (15 percent) 
polyacrylamide gel electrophoresis (J.V. Maizel Jr. et a].., Meth 

• Viral 5, 180-246 [1971]). 

Figure 3 illustrates a protein gel analysis in which total 
protein from various cultures is separated by size. The density of 
10 individual bands reflects the quantity in which' the respective • 
proteins are present. With reference to Figure 3, lanes 1 and 7 are 
controls and comprise a variety of proteins of previously determined 
size which serve as points of comparative reference. Lanes 2 and 3 
segregate cellular protein from colonies of E. coli 294 transformed 
15 with plasmid pSom7 a2 and respectively grown in LB (lane 2) and M9 
(lane 3) media. Lanes 4 and 5 segregate cellular protein obtained 
from similar cells transformed with the plasmid P Tha7 a2, a thymosin' 
expression plasmid obtained by procedures essentially identical to 
those already described, beginning with the plasmid pThal (see the 
20 commonly assigned US patent application of Roberto Crea and Ronald B. 
Wetzel, filed February 28, 1980 for Thymosin Alpha 1 Production, the 
disclosure of which is incorporated herein by reference). Lane 4 
segregates cellular protein from E. coli 294/pTha7 a2 grown in LB 
media, whereas lane 5 segregates cell protein from the same- 
25 transformant grown in M9 media. Lane 6, another control, is the 
protein pattern of E. coli 294/pBR322 grown in LB. 

Comparison to controls shows the uppermost of the two most 
prominent bands in each of lanes 3 and 5 to be proteins of size 
anticipated in the case of expression of a fusion protein comprising 
30 the D' polypeptide and, respectively, somatostatin and thymosin (the 
other prominent band represents the LE' polypeptide resulting from 
deletion of the attenuator). Figure 3 confirms that expression is 
repressed in tryptophan-rich media, but derepressed under tryptophan 
deficient conditions. 
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D. Cyanogen bromide cleavage and radioimmunoassay for hormone product 

For both the thymosin and somatostatin cases, total cellular 
protein was cyanogen bromide-cleaved, the cleavage product recovered 
• and, after drying, was resuspended in buffer and analyzed by radio- 
5 immunoassay, confirming the expression of product immunologically 
identical, respectively, to somatostatin anoVthymosin. Cyanogen 
bromide cleavage was as described in D.V. Goeddel et ah, Proc Nat'l 
Acad Sci USA 76, 106-110 [1979]). • • 

-II. Construction of plasmids for direct expression of heterologous 
10 genes under control of the trp promoter-operator system 

The strategy for direct expression entailed creation of a 
plasmid containing a unique restriction site distal from all control 
elements of the trp operon into which heterologous genes could be 
cloned in lieu of the trp leader sequence and in proper, spaced 
15 relation to the trp leader polypeptide's ribosome binding site. The 
• direct expression approach is next exemplified for the case of human 
growth hormone expression. 

The plasmid pSon>7 a2, 10 M g, was cleaved with EcoRI and the 0NA 
fragment 5 containing the tryptophan genetic elements was isolated by 
20 PAGE and electroelution. This fragment, 2 M g, was digested with the 
restriction endonuclease Taq I, 2 units, 10 minutes at 37*C such 
that, on the average, only one of . the approximately five Taq I sites 
in each molecule is cleaved. This partially digested mixture of 
fragments was separated by PAGE and an approximately 300 base pair 
25 fragment 12 (Fig. 4) that contained one EcoRI end and one Taq I end 
was isolated by electroelution. The corresponding Taq I site is 
located between the transcription start and translation start sites 
and is 5 nucleotides upstream from the ATG codon of the trp leader 
peptide. The DNA sequence about this site is shown in figure 4. By 
30 proceeding as described, a fragment could be isolated, containing all 
control elements of the trp operon, i.e., promoter-operator system, 
transcription initiation signal, and trp leader ribosome binding 



site. 
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The Taq I residue at the 3' end of the resulting fragment 
adjacent the translation start signal for the trp leader sequence was 
next converted into an Xbal site, as shown in Figure 5. This was 
done by ligating the fragment 12 obtained above to a plasmid 
5. containing a unique (i.e., only one) EcoRI site and a unique Xbal 
site. For this purpose, one may employ essentially any plasmid 
containing,- in order, a-repljcon, a selectable marker such as 
antibiotic resistance, and EcoRI, Xbal and BamHI sites. Thus, for 
: example, an Xbal site, can be introduced between the EcoRI and -BamHI 
10 sites of pBR322 (F. Bolivar et a].., Gene 2, 95-119 [1977]) by, e.g., 
cleaving at the plasmid' s unique Hind III site with Hind III followed 
by single strand-specific nuclease digestion of the resulting sticky 
ends, and blunt end ligation of a self annealing double-stranded 
synthetic nucleotide containing the recognition site such as 
15 CCTCTAGAGG. Alternatively, naturally derived DNA fragments may be j 
employed, as was done in the present case, that contain a single Xbal 
site between EcoRI and BamHI cleavage residues. Thus, an EcoRI and 
BamHI digestion product of the viral genome of hepatitis B was 
obtained by conventional means and cloned into the EcoRI and BamHI 
20 sites of plasmid pGH6 (O.V. Goeddel et al., Nature 281 , 544 [1979])) 
to form the plasmid pHS32. Plasmid pHS32 was cleaved with Xbal, 
phenol. extracted, chloroform extracted and ethanol precipitated. It 
was theii.treated with 1 „1 E. coli polymerase I, Klenow fragment 
(Boehrjnger-Mannheim) in 30 „1 polymerase buffer (50 mM potassium 
25 phosphate pH 7.4, 7mM MgCl 2 , 1 mM s-mercaptoethanol) containing 
O.lmH dTTP and O.lmM dCTP for 30 minutes at 0*C then 2 hr. at 37*C. 
This treatment causes 2 of the 4 nucleotides complementary to the 5' 
protruding end of the Xbal cleavage site to be filled in: 

5' CTAGA 5 ' CTAGA 

30 - 3' T— > 3 - TCT > 

Two nucleotides, dC and dT, were incorporated giving an end with two 
5' protruding nucleotides. This linear residue of plasmid pHS32 
(after phenol and chloroform extraction and recovery in water after 
ethanol precipitation) was cleaved with EcoRI. The large plasmid 
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fragment 13 was separated from the smaller EcoRI-Xbal fragment by 
PAGE and isolated after electroelution. This ONA fragment from pHS32 
(0.2 pg), was ligated, under conditions similar to those described 
above, to the EcoRI-Taq I fragment of the tryptophan operon (-0.01 
jig), as shown in Figure 5. In this process the Tag. I protruding end 
is ligated to the Xbal remaining protruding end even though it is not 
completely Watson-Crick base-paired: 

T + CTAGA TCTAGA 

AGC TCT *" AGCTCT 



A portion of this ligation reaction mixture was transformed into E. 
coli 294 cells as in part I. above, heat treated and plated on LB 
plates containing ampicillin. Twenty-four colonies were selected, 
grown in 3 ml LB media, and plasmid isolated. Six of these were 
found to have the Xbal site regenerated via E. coli catalyzed DNA 
15 repair and replication: 

TCTAGA TCTAGA 

AGCTCT "~ AGATCT 

These plasmids were also found to cleave both with EcoRI and Hpal and 
to give the expected restriction fragments. One plasmid 14, desig- 
nated pTrp 14, was used for expression of heterologous polypeptides, " 
as next discussed. 



The plasmid pHGH 107 (18 in Figure 6, D.V. Goeddel et al, Nature . 
281. 544 » 1979) contains a gene for human growth hormone made up of 
23 amino acid codons produced from synthetic DNA fragments and 163 
25 amino acid codons obtained from complementary DNA produced via 
reverse transcription of human growth hormone messenger RNA. This 
gene 21, though it lacks the codons of the "pre" sequence of human 
growth hormone, does contain an ATG translation initiation codon. 
The gene was isolated from 10 ug pHGH 107 after treatment with EcoRI 
followed by E. coli polymerase I Klenow fragment arid dTTP and dATP as 
described above. Following phenol and chloroform extraction and 
ethanol precipitation the plasmid was treated with BamHI. See Figure 
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The human growth hormone ("HGH") gene-containing fragment 21 was 
isolated by PAGE followed by electrocution. The resulting DNA 
fragment also contains the first 350 nucleotides of the tetracycline 
resistance structural gene, but lacks the tetracyline 

5 promoter-operator system so that, when subsequently cloned into an 
expression plasmid, plasmids containing the insert can be located by 
the restoration of tetracycline resistance. Because the EcoRI end of 
the fragment 21 has been filled in by the Klenow polymerase I- 
procedure, the fragment has one blunt and one sticky end, ensuring 

10 proper orientation when later inserted into an expression plasmid. 
See Figure 6. 

The expression plasmid P Trpl4 was next prepared to receive the 
HGH gene-containing fragment prepared above. Thus, P Trpl4 was Xbal 
digested and the resulting sticky ends filled in with the Klenow 

15 polymerase I procedure employing dATP, dTTP, dGTP and dCTP. After 
phenol and chloroform extraction and ethanol precipitation the 
resulting DNA 16 was treated with BamHI and the resulting large 
plasmid fragment 17 isolated by PAGE and electroelution. The 
pTrpl4-derived fragment 17 had one blunt and one sticky end, 

20 permitting recombination in proper orientation with the HGH gene 
containing fragment 21 previously described. 

The HGH gene fragment 21 and the P Trpl4 AXba-BamHI fragment 17 
were combined and ligated together under conditions similar to those 
described above. The filled in Xbal and EcoRI ends ligated together 
25 by blunt end ligation to recreate both the Xbal and the EcoRI site: 



Xbal filled in EcoRI filled in HGH gene initiation 

—TCTAG + AATTCTATG — T^Ja4aTTCTATG_ 

— AGATC TTAAGATAC ^ ^(^TA^ATAC— 

Xbal EcoRI 
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This construction also recreates the tetracycline resistance gene. ' 
Since the plasmid pHGH 107 expresses tetracycline resistance from a 
promoter lying upstream from the HGH gene (the lac promoter), this 
construction 22, designated pHGH 207, permits expression of the gene 
5 for tetracycline resistance under the control of the tryptophan 

promoter-operator. Thus the ligation mixture was transformed into E. 
coli 294 and colonies selected on L3 plates containing 5 ug/ml 
"tetracycline. 

In order to confirm the direct expression of human growth 
10 hormone from plasmid pHGH 207, total cellular protein derived from 
E.coli 294/pHGH 207 that had been grown to optical density 1 in LB 
media containing 10 ug/ml ampicillin and diluted 1 to 10 into M9 
media, and grown again to optical density 1, was subjected to SOS gel 
electrophoresis as in the case of part I. above and compared to 

15 similar electrophoresis data obtained for human growth hormone as 
previously expressed by others (D.V. Goeddel et al, Nature , 281 , 544 
(1979)). Figure 7 is a photograph of the resulting, stained gel 
wherein: Lanes 1 and 7 contain protein markers of various known 
sizes; Lane 2 is a control that separates total cellular protein of 

20 E. Coli strain 294 p8R322; Lane 3 segregates protein from E. Coli 
294/pHGH 107 grown in LB media; Lane 4 segregates protein from E. 
Coli 294/pHGH 107 grown in M9 media; Lane 5 segregates protein from 
E.. Coli 294/pHGH 207 grown in LB media; and Lane 6 segregates protein 
from E. Coli 294/pHGH 207 grown in M9. The dense band in Lane 6 is 

25 human growth hormone, as shown by comparison to the similar bands in 
Lanes 2-4. As predicted by the invention, the organism E. Coli 
294/pHGH 207 when grown in tryptophan-rich LB media produces less 
human growth hormone by reason of tryptophan repressor/operator 
interactions, and when grown in M9 media produces considerably more 

30 HGH than E. Coli 294/pHGH 107 owing to the induction of the stronger 
tryptophan promoter-operator system vs the Jac promoter-operator 
system in pHGH 107. 



III. Creation of a general expression plasmid for the direct 
expression of heterologous genes under control of the tryptophan 
35 promoter-operator. 
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The plasmid pHGH 207 created in the preceding section was next 
used to obtain a DNA fragment containing the control elements of the 
tryptophan operon (with the attenuator deleted) and to create a 
plasmid u expression vector" suitable for the direct expression of 

5 . various structural gene inserts- The strategy for creation of the 
general expression plasmid involved removal of the tryptophan control 
region from pHGH 207 by EcoRI digestion and insertion in the 
EcoRl-digested plasmid pBRHl used in part I. supra. pBRHl , as 
previously noted, is an ampicillin resistant plasmid containing the 

10 tetracycline resistance gene but is tetracycline sensitive because of 
the absence of a suitable promoter-operator system. The resulting 
plasmid, pHKY 1, whose construction is more particularly described 
below and shown in Figure 8,- is both ampicillin and -.tetracycline 
resistant, contains the tryptophan promoter-operator system, lacks 

15 the tryptophan attenuator, and contains a unique Xbal site distal 
from the tryptophan promoter-operator. The tryptophan promoter- 
operator and unique Xbal site are bounded by EcoRI sites, such that 
the promoter-operator-Xbal-containing fragment can be removed for 
insertion in other structural gene-containing plasraids. 

20 Alternatively, heterologous structural genes may be inserted, either 
into the Xbal site or (after partial EcoRI digestion) into the EcoRI 
site distal from the tryptophan control region, in either case so as 
to come under the control of the tryptophan promoter-operator system. 

Plasmid pHGH 207 was EcoRI digested and the trp promoter 
25 containing EcoRI fragment j?3 recovered by PAGE followed by 
electrocution. 

Plasmid pBRHl was EcoRI digested and the cleaved ends treated 
with bacterial alkaline phosphatase ("BAP") (1 pg, in 50 mM tris pH 8 
and 10 mM MgC^ for 30 min. at 65" C) to remove the phosphate groups 

30 on the protruding EcoRI ends. Excess bacterial alkaline phosphatase 
was removed by phenol extraction, chloroform extraction and ethanol 
precipitation. The resulting linear DNA 2i» because it lacks 
phosphates on the protruding ends thereof, will in ligation accept 
only inserts whose complementary sticky ends are phosphorylated but 

35 will not itself recircularize, permitting more facile screening for 
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plasmids containing the inserts. The EcoRI fragment derived from 
pHGH 207 and the linear ONA obtained from pBRHl were combined in the 
presence of T 4 ligase as previously described and ligated. A 
portion of the resulting mixture was transformed into E. coli strain 
294 as previously described, plated on LB media containing 5 u g/ml of 
tetracycline, and 12 tetracycline resistant colonies selected. 
Plasmid was isolated from each colony and examined for the presence 
of a ONA insert by restriction endonuclease analysis employing- EcoRI 
.and Xbal. One plasmid containing the insert was designated pHKYl. 



10 IV. Creation of a plasmid conta4ning the tryptophan operon capable 
of expressing a specifically cleavable fusion protein comprising 6 
amino acids of the trp leader peptide and the last third of the trp E 
polypeptide (designated LE') and a heterologous structural gene 
product. 



15 



The strategy for the creation of a LE' fusion protein expression 
plasmid entailed the following steps: 



a. Provision of a gene fragment comprising codons for the 
distal region of the LE'polypeptide-having Bgl II and EcoRI 
sticky ends respectively at the 5' and at the 3' ends of the 

20 coding strand; 

b. Elimination of the codons from the distal region of the LE' 
gene fragment and those for the trp 0 gene from plasmid SOM 7 a2 
and insertion of the fragment formed in step 1, reconstituting 
the LE' codon sequence immediately upstream from 

25 that for the heterologous gene for somatostatin. 

1. With reference to Figure 9(a), plasmid P Som7 a2 was Hind III 
digested followed by digestion with lambda exonuclease (a 5' to 
3'exonuclease) under conditions chosen so as to digest beyond the Bgl 
II restriction site within the LE' encoding region. 20 „g of Hind 
30 Ill-digested pSom 7 a2 was dissolved in buffer [20n« glycine buffer, 
pH 9.6. ImM MgCl 2 , ImM B-mercaptoethanol ]. The resulting mixture 
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was treated with 5 units of lambda exonuclease for 60 minutes at room 
temperature. The reaction mixture obtained was then phenol 
extracted, chloroform extracted and ethanol precipitated. 

In order ultimately to create an EcoRI residue at the distal end 
5- of the LE' gene fragment a primer 32 pCCTGTGCAT6AT was synthesized 
by the improved phosphotriester method (R. Crea et j^., Proc Nafl 
Acad Sci USA 75, 5765 [1978]) and hybridized to the single stranded 
end of the LE* gene fragment resulting from lambda exonuclease 
digestion. The hybridization was performed as next described. 

10 20ug of the lambda exonucl ease-treated Hind III digestion 

product of plasmid P Som7 a2 was dissolved in 20„1 H 2 0 and combined 
with 6 P 1 of a solution containing approximately 80 picomoles of the 
5'-phosphorylated oligonucleotide described above. The synthetic 
fragment was hybridized to the 3' end of the LE* coding sequence and 

15 the remaining single strand portion of the LE' fragment was filled in 
by the Klenow polymerase I procedure described above, using dATP, 
dTTP, dGTP and dCTP. 

The reaction mixture was heated to 50*C and let cool slowly to 
10*C, whereafter 4 U 1 of Klenow enzyme were added. After IS minute 

20 room temperature incubation, followed by 30 minutes incubation at 
37 C, the reaction was stopped by the addition of 5 M 1 of 0.25 molar 
EDTA. The reaction mixture was phenol extracted, chloroform 
extracted and ethanol precipitated. The DNA was subsequently cleaved 
with the restriction enzyme Bgl II. The fragments were separated by 

25 PAGE. An autoradiogram obtained from the gel revealed a 

P-labelled fragment of the expected length of approximately 470 
bp, which was recovered by electroelution. As outlined, this 
fragment LE'(d) has a Bgl II and a blunt end coinciding with the 
beginning of the primer. 

30 The plasmid pThal described in part 1(C)' above carries a 

structural gene for thymosin alpha one cloned at its 5' coding strand 
end into an EcoRI site and at its 3' end into a BamHI site. As shown 
in Figure 9, the thymosin gene contains a Bgl II site as well. 
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Plasmid pTh«l also contains a gene specifying ampicillin resistance. 
In order to create a plasmid capable of accepting the LE'(d) fragment 
prepared above, pThal was- EcoRI digested followed by Klenow 
polymerase I reaction with dTTP and dATP to blunt the EcoRI 
residues. Bgl II digestion of the resulting product created a linear 
DNA fragment 33 containing the gene for ampicillin resistance and, at 
its opposite ends, a sticky Bgl II residue and a blunt end. The 
resulting product could be recircularized by reaction with the LE' (de- 
fragment containing a Bgl II sticky end and a blunt end in the ' 
presence of T 4 ligase to form the plasmid P Trp24 (Fig. 9b). In 
doing so, an EcoRI site is recreated at the position where blunt end 
. ligation occurred. 

With reference to Figure 10, successive digestion ofpTr P 24 with 
Bgl II and EcoRI, followed by PAGE and electroelution yields a 

15 fragment having codons for the LE'(d) polypeptide with a Bgl II 
sticky end and an EcoRI sticky end adjacent its 3' coding terminus 
The LE'(d) fragmeht 38 can be cloned into the Bgl II site of plasmid 
P Som7 a2 to form an IE' polypeptide/ somatostatin fusion protein 
expressed under the control of the tryptophan promoter-operator, as 

20 shown in Figure 10. To do so requires (1) partial EcoRI digestion" of 
P Som7 A2 in order to cleave the EcoRI site distal to the tryptophan 
promoter-operator, as shown in Figure 10 and (2) proper choice of the 
primer sequence (Figure 9) in order to. properly maintain the codon 
reading frame, and to recreate an EcoRI cleavage site. 

25 Thus, 16 ug plasmid P Som7 a2 was diluted into 200 „1 of buffer 

containing 20 mM Tris, pH 7.5, 5 mM HgCl,,, 0.02 NP40 detergent 
100 mM NaCI and treated with 0.5 units EcoRI. After 15 minutes at 
37 C, the reaction mixture was phenol extracted, chloroform extracted 
and ethanol precipitated and subsequently digested with Bgl II The 

30 larger resulting fragment 36 isolated by the PAGE procedure followed 
by electroelution. This fragment contains the codons "L£'(p)" for 
the proximal end of the LE' polypeptide, ie, those upstream from the 
Bgl II sue. The fragment 36 was next ligated to the fragment 38 in 
the presence of ONA ligase to form the plasmid P Som7 *2a4, which 
35 upon transformation into E. coli strain 294, as previously described, 
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efficiently produced a fusion protein consisting. of the fully 
reconstituted LE* polypeptide and somatostatin under the control of 
the tryptophan promoter-operator. The fusion protein, from which the 
• somatostatin may be specifically cleaved owing to the presence of a 
5 methionine at the 5' end of the somatostatin sequence was segregated 
by .SOS polyacrylamide gel electrophoresis as previously described. 
The fusion protein product is the most distinct band^pparent in Lane 
6 of Figure 11, discussed in greater detail in Part VI, infra; •• 

V. Creation of an expression system for trp IE' polypeptide fusions 
10 wherein tetracycline resistance is placed under the control of the 
tryptophan promoter-operator. 

The strategy for creation of an expression vehicle capable of 
receiving a wide variety of heterologous polypeptide genes for 
expression as trp LE* fusion proteins under the control of the 
15 tryptophan operon entailed construction of a plasmid having the 
following characteristics: 

1. Tetracycline resistance which would be lost in the event of 
the promoter-operator system controlling the genes specifying 
such resistance was excised. 

20 2. Removing the promoter-operator system that controls 

tetracycline resistance, and recircularizing by ligation to a 
heterologous gene and a tryptophan promoter-operator system in 
proper reading phase with reference thereto, thus restoring 
tetracycline resistance and accordingly permitting 

25 identification of plasmids containing the heterologous gene 

insert. 

In short, and consistent with the nature of the intended inserts, the 
object was to create a linear piece of ONA having a Pst residue at 
its 3' end and a Bgl II residue at its 5' end, bounding a gene 
30 capable of specifying tetracycline resistance when brought under the 
control of a promoter-operator system. 
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Thus, with reference to figure 12, plasmid pBR322 was Hind III 
dusted and the protruding Hind III ends in turn digested with SI 
nuclease. The SI nuclease digestion involved treatment of 10 „g of 
Hind Ill-cleaved pBR322 in 30 „1 SI buffer (0.3 M NaCl. 1 * ZnCl„ 

•S 25 mM sodium acetate, pH 4.5) with 300 units SI nuclease for 30 

^nutes at 15'C. The reaction was stopped by the additon of 1 P l of 
30 X SI nuclease stop solution (0.8M tris base,->0 mM EDTA). The 
mixture was phenol extracted, chloroform extracted and ethanol 
precipitated, then EcoRI digested as previously described and the 

10 large fragment 46 obtained by PAGE procedure followed by 

electroelution. The fragment obtained has a first EcoRI sticky end 
and a second, blunt end whose coding strand. begins with the ' 
nucleotide thymidine. As will be subsequently shown, the Sl-digested 
Hind III residue beginning with thymidine can be joined to a Klenow 

15 polymerase I-treated Bgl II residue so as to reconstitute the Bgl II 
restriction site upon ligation. 

Plasmid P Som7 a2, as prepared in Part I above, was Bgl II' 
digested and the Bgl II sticky ends resulting made double stranded 
with the Klenow polymerase I procedure using all four deoxynucleotide 
20 triphosphates. EcoRI cleavage of the resulting product followed by 
PAGE and electroelution of the small fragment 42 yielded a linear 
Piece of DNA containing the tryptophan promoter-operator and codons 
of the LE' "proximal- sequence upstream from the Bgl II site 

^'[V: ^ Pr ° dUCt h3d 30 EC0RI end and * blunt end resulting 
25 from fm,ng in the Bgl II site. However, the Bgl II site is 
reconstituted by ligation of the blunt end of fragment 42 to the 
blunt end of fragment 46. Thus, the two fragments were ligated in 
the presence of T„ DNA ligase to form the recircularized plasmid 
PHKY 10 (see figure 12) which was propagated by transformation into 
30 competent E. coHstrain 294 cells. Tetracycline resistant cells 
bear.ng the recombinant plasmid pHKY 10 were grown up. plasmid DNA 
extracted and digested in turn with Bgl II and Pst followed by 
isolation by the PAGE procedure and electroelution of the large 

v; l7 9 Tl\ a line3r PiSCe ° f ° NA haV1 " 9 Pst and 11 ^icky ends. 
35 This DNA fragment 49 contains the origin of replication and 
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subsequently proved useful as a first component in the construction 
of plasmids where both the genes coding for trp LE* polypeptide 
fusion proteins and the tet resistance gene are controlled by the trp 
promoter /operator. 

5. . Plasmid pSom7 a2a4, as previously prepared in Part IV, could be 
manipulated to provide a second component for a system capable of 
receiving a wide variety of heterologous structural genes. With 
reference to Figure 13, the plasmid was subjected to partial EcoRI 
digestion (see Part IV) followed by Pst digestion and fragment 51 

10 containing the trp promoter/operator was isolated by the PAGE 

procedure followed by electroelution. Partial EcoRI digestion was 
necessary to obtain a fragment which was cleaved adjacent to the 5' 
end of the somatostatin gene but not cleaved at the EcoRI site 
present between the ampicillin resistance gene and the trp promoter 

15 operator. Ampicillin resistance lost by the Pst I cut in the ap R 
gene could be restored upon ligation with fragment 51. 

In a first demonstration the third component, a structural gene 
for thymosin alpha-one was obtained by EcoRI and BamHI digestion of 
plasmid pThol. The fragment, 52, was purified by PAGE and 
20 electroelution. 

The three gene fragments 49, 51 and 52 could now be ligated 
together in proper orientation, as depicted in Figure 13, to form the 
plasmid pTha7AlA4, which could be selected by reason of the 
restoration of ampicillin and tetracycline resistance. The plasmid, 

25 when transformed into E. coli strain 294 and grown up under • 
conditions like those described in Part I, expressed a trp LE' 
polypeptide fusion protein from which thymosin alpha one could be 
specifically cleaved by cyanogen bromide treatment. When other 
heterologous structural genes having EcoRI and BamHI termini were 

30 similarly ligated with the pHKYlO-derived and pS0M7 4 2a4-derived 
components, trp LE' polypeptide fusion proteins containing the 
polypeptides for which those heterologous genes code were likewise 
efficiently obtamed. Figure 11 illustrates an SOS polyacrylamide 
gel electrophoresis separation of total cellular protein from E. coli 
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10 



15 



train 294 transforms, the darkest band in each case representing 
the fusion protein product produced under control of the tryptophan 
promoter-operator system. In Figure 11, Lane 1 is a control which 
segregates total cellular protein from E. coH 294/pBR322 Lane 2 
contains the somatostatin fusion product from plasmid pSom7 A 2a4 
prepared in Part IV. Lane 3 is the somatostatin-containing 
expression product of ^Som7 A 1 A 4. Lane 4 contains the expression 
product of P Tho7al A 4, whereas Lane 5 contains the product expressed 
from a plasmid obtained when the pHKY-10-derived and pSonrf * 
A 2A4-derived fragments discussed, above were ligated with an 
EcoRI/BamM tenninated structural gene encoding human proinsulin and 
prepared in part by certain of us. Lanes 6 and 7 respectively 
contain, as the darkest band, a trp LE' polypeptide fusion protein 
from which can be cleaved the B and A chain of human insulin. The 
insulin B and A structural genes were obtained by EcoRI and BamHI 
digest,on of plasmids pIBl and pIAll respectively, whose construction 
is disclosed in D.V. Goeddel et al., Proc «af 1 Acad Scj_USA 76, 106 
L^/yj. Lane 8 contains size markers, as before. 



* * * 



While the invention in its most preferred embodiment is 
20 described with reference to E. coli, other enterobacteriaceae could 
likew 1S e serve as host cells for expression and as sources for trp 
operons, among which may be mentioned as examples Salmonella 
; VPh1muHum and ^tia inajxesans . Thus, the invention is not to be 
lifted to the preferred embodiments described, but only by the 
25 lawful scope of the appended claims. 
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CLAIMS ; 

1. A method of creating an expression plasmid for the 
expression of a heterologous gene which comprises the 
simultaneous ligation, in phase, of: 

(a) a first linear double-stranded DNA fragment 
containing a replicon and a gene which expresses a 
-selectable characteristic when placed under the 
direction of a bacterial promoter, said fragment 
lacking any such promoter; 

(b) a second linear double-stranded DNA fragment 
comprising said heterologous gene; and 

(c) a third double-stranded DNA fragment which comprises 
a bacterial promoter; 

the ligatable ends of said fragments being configured such 
that upon ligation to form a replicable plasmid both the gene 
for the selectable characteristic and the heterologous gene 
come under the direction of the promoter, thus permitting use 
of the selectable characteristic in selection of transformant 
bacteria colonies capable of expressing the heterologous gene. 

2. The method of claim 1 wherein the selectable 
characteristic is antibiotic resistance. 

3. The method of claim 2 wherein the selectable 
characteristic is tetracycline resistance and wherein the 
bacterial promoter is the trp promoter. 

4. The method of claim 3 wherein ligation reconstitutes an 
operon for the expression of arapicillin resistance as well. 

5. A method of cleaving double stranded DNA at any given 
point which comprises: 

(a) converting the double stranded DNA to single- 
stranded DNA in a region surrounding said point; 
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(b) hybridizing to the single-stranded region formed in 
step (a) a complementary primer length of single- 
stranded DNA, the 5« end of the primer lying . 
opposite the nucleotide adjoining the intended 
cleavage site; 

(c) restoring that portion of the second strand • 

^ .eliminated in step (a) which lies in the 3' direction 
from said primer by reaction with DNA polymerase in 
the presence of adenine, thymine, guanine and 

cytosine-containing deoxynucleotide triphosphates; 
and 

(d) digesting the remaining single-stranded length of 
DNA which protrudes beyond the intended cleavage 
point. 



6. The method of claim 5 wherein steps (c) and (d) are 
performed simultaneously by reaction with DNA polymerase which 
polymerizes in the direction of 5' -»3', is exonucleolytic in the 
direction of 3' -> 5', but non-exonucleolytic in the direction of 5' -* 3\ 

7. The method of claim 6 wherein the polymerase is Klenow 
Polymerase I. 

8. A plasmidic expression vehicle for the production in 

E. coli bacteria of a heterologous polypeptide product, said 
vehicle having a sequence of double-stranded DNA comprising, 
in phase from a first 5' to a second 3' end of the coding 
strand thereof, the elements: 

(i) a bacterial trp promoter-operator system; 
(ii) nucleotides coding for a ribosome binding site for 
translation of element (iv) ; 
(iii) nucleotides coding for a translation start signal 

for translation of element (iv) ; and 
(iv) a structural gene encoding the amino acid sequence 
of a heterologous polypeptide; 
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said sequence comprising neither any trp attenuation 
capability nor nucleotides coding for the trp E ribosome 
binding site. 

9." The method of producing a polypeptide product by the 
expression in bacteria of a structural gene coding therefor 
which comprises: 

(a) providing a bacterial inoculant transformed with a 
replicable plasmidic expression vehicle having a 
sequence of double-stranded DNA comprising, in 
phase from a first 5 1 to a second 3' end of the 
coding strand thereof, the elements: 

(i) a bacterial trp promoter-operator system; 
(ii) nucleotides coding for a ribosome binding 
site for translation of element (iv) ; 
(iii) nucleotides coding for a translation start 
signal for translation of element (iv); and 
(iv) a structural . gene encoding the amino acid 
sequence of a heterologous polypeptide; 
said sequence comprising neither any trp attenuation 
capability nor nucleotides coding for the trp E 
ribosome binding site; 

(b) placing. the transformed inoculant in a fermentation 
vessel and growing the same to a predetermined level 
in suitable nutrient media containing additive 
tryptophan sufficient in quantity to repress said 
promoter-operator system; and 

(c) depriving said bacteria of said additive so as to 
derepress said system and occasion the expression of 
the product for which said structural gene codes. 

10. The vehicle of claim 8 or method of claim 9 wherein the 
polypeptide expressed by said structural gene is entirely 
heterologous. 
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11. The vehicle of claim 8 or the method of claim 9 wherein 
the polypeptide expressed is a fusion protein comprising a 
heterologous polypeptide and at least a portion of the amino 
acxd sequence of a homologous polypeptide. 

12. The vehicle or method of claim 11- wherein said portion is 
a portion of the amino acid sequence of an enzyme involved in 
the biosynthetic pathway from chorismic acid to tryptophan. 

13. The vehicle or method of claim 12 wherein the heterologous 
polypeptide is a bioactive polypeptide and the fused homologous 
polypeptide is a specifically cleavable bioinactivating 
polypeptide. 

14. The vehicle or method of claim 11 wherein the homologous 
polypeptide is the trp E polypeptide and wherein said ribosome 
binding site is the ribosome binding site for the trp leader 
polypeptide. 

15. The vehicle or method of claim 11 wherein the homologous 
polypeptide is the trp D polypeptide. 

16. The vehicle or method of claim 14 wherein the fusion 
protein comprises an heterologous polypeptide and a homologous 
polypeptide wbich itself constitutes a fusion of about the 
first six amino acids of the trp leader polypeptide and the 
amino acid sequence encoded by at least about the distal 
third of the trp E polypeptide gene. 

17. The vehicle or claim 8 or method of claim 9 wherein the 
heterologous polypeptide comprises a recoverable polypeptide 
selected from the group consisting of human growth hormone, 
human proinsulin, somatostatin, thymosin alpha 1, the A chain 
of human insulin and the B chain of human insulin. 



/ 
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18. The method of claim 8 wherein tryptophan deprivation is 
effected by cessation of addition of said additive and by 
dilution of the fermentation media in which said inoculant is 
first grown up. 

19. The method of claim 18 wherein the host bacteria is 
E. coli . 

20. The plasmids pBRHtrp, pSOM7A2, pHGH207, pHKYl, pSOM7A2A4, 
pThya7AlA4, and pTha7A2. 
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